Llava - The First Instruction Following Multi-Modal Model